4 


AD-A032  318 


unclassified 


AD 

A0323IS 


SAINT  LOUIS  UNIV  MO  DEPT  OF  EARTH  AND  ATMOSPHERIC  S— ETC  F/G  4/2 
RESEARCH  TO  DEVELOP  IMPROVED  MODELS  OF  CLIMATOLOGY  THAT  WILL  AS— ETC (U) 
AUG  76  D E MARTIN  F19628-74-C-0004 

AFGL-TR-76-0249  NL 


IDA032318 


AFGL -TR-76-0249 


RFSEARCH  to  develop  improved  models  of 
CLIMATOLOGY  THAT  WILL  ASSIST  THE  METEOROLOGIST  IN 
THE  TIMELY  OPERATION  OF  THE  AIR  FORCE 
WEATHER  DETACHMENTS 


Donald  E.  Martin 


Saint  Louis  University 

Department  of  Earth  and  Atmospheric  Sciences 
St.  Louis,  Missouri  63103 


Final  Report 

1 July  1975  - 31  August  1976 


31  August  1976 


Approved  for  public  release;  distribution  unlimited 


AIR  FORCE  GEOPHYSICS  LABORATORY 
AIR  FORCE  SYSTEMS  COMMAND 
UNITED  STATES  AIR  FORCE 
HANSCOM  AFB,  MASSACHUSETTS  01731 


Qualified  requestors  may  obtain  additional  copies  from  the  Defense 
Documentation  Center.  All  others  should  apply  to  the  National 
Technical  Information  Service. 


. «,•  Mb*.  4 + 


SECURITY  CLASSIFICATION  OF  THIS  RAGE  (Whmn  P.C  g nl,r,d) 

REPORT  DOCUMENTATION  PAGE  befoERAEDcom^e^gNform 

< REBAR  t NUMBER  |2.  GOVT  ACCESSION  NO.  3 RECIPI  ENT'S  C AT  ALOG  NUMBER 


I REPORT  DUMBER  2.  GOVT  ACCESSION  NO.  3 RECIPIENT'S  CATALOG  NUMBER 

, : AFGI>-TR-76-0249  ) ^ 

*.  title  (and  SubitiU)  / RESEARCH  TO  DEVELOP  IMPROVED  MODELS  5 REPORT  * PEmoo CO'/EPEO 

np  rT.TMATOTnoY  th at  WTT.i.  asstst  thp  MPTPnRnmoT rt  Final  Report  / J 


OF  CLIMATOLOGY  THAT  WILL  ASSIST  THE  METEOROLOGIST 
IN  THE  TIMELY  OPERATION  OF  THE  AIR  FORCE  WEATHER 
DETACHMENTS . 


Covers  life  of  contract. 

« PERFORMING  ORG.  REPORT  NUMBER 


7.  AUTHOR!*) 

^Donald  E.  Martin 

"i  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 

Saint  Louis  University 

Dept,  of  Earth  and  Atmospheric  Sciences 
St.  Louis,  MO  63103 

II  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

Air  Force  Geophysics  Laboratory 
Hanscom  AFB , MA.  01731 

Contract  Monitor:  I.  I.  Gringorten,  LYK 

l«  MONITORING  AGENCY  NAME  & ADDRESS (it  <lllt,r»nt  from  Controlling  Ollt c*J 


t GRANT  NUMBER!*) 


F 19628-74 -C-000 4 


10.  PROGRAM  ELEMENT.  PROJECT,  TASK 
AREA  a WORK  UNIT  NUMBERS 

Proj.ect,  Task,  Work  Unit 
86240201,  DOD  62101F 
Sub  Element  'C8&624 

12.  REPORT  OATE 

31  Aug  1976 

H.  NUMBER  OF  PAGES 

, 55  f - . 

15.  SECURITY  CLASS,  (ot  ttilM  T»PQft) 

Unclassified 


/7M 


15a.  DECLASSIFICATION/ DOWNGRADING 
SCHEDULE 


I 16  DISTRIBUTION  STATEMENT  (of  this  Report) 


Approved  for  public  release;  distribution  unlimited. 


\V7.  DISTRIBUTION  STATEMENT  (of  the  abstract  entered  in  Block  20,  if  different  from  Raporf.) 


i is  supplementary  notes 


[l9.  KEY  WORDS  ( Continue  on  reverse  aide  if  neceeeary  and  identify  by  block  number ) 


Unconditional  Climatology 
Mission  Success  Indicators 
Joint  Probabilities 


20.  ABSTRACT  ( Continue  on  reverse  side  If  necessary  end  identify  by  block  number) 

This  research  provides  a means  of  modelling  the  joint  relationships  of  ceiling 
and  visibility  probabilities  among  stations  as  a function  of  distance  and  season. 
Analytic  formulations  are  presented  which  show  definite  skill  over  the  assumption 
of  data  independency.  These  functions  carry  the  added  advantage  of  being 
applicable  for  any  route  irrespective  of  direction,  topography  or  location. 

A preliminary  attempt  has  also  been  made  to  compact  the  unconditional  prob-  — v 


dd 


EDITION  OF  I NOV  65  IS  OBSOLETE 


UNCLASSIFIED ^ '"'J 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  f»7,*n  Dmt*  Enl*,*dJ 

y . 


TABLE  OF  CONTENTS 


Page 

I.  INTRODUCTION 1 

II.  MISSION  SUCCESS  INDICATORS 2 

1)  NOTATIONS 2 

2)  DEFINITIONS  AND  EXAMPLES 2 

a)  CMSI  BASED  UPON  THE  ASSUMPTION  OF 

INDEPENDENCY  (CMSI*) 2 

b)  CMSI  WHICH  ALLOW  FOR  DATA  DEPENDENCIES  CMSIt.  3 

3)  CMSI  MODELLING  EFFORTS 5 

a)  THE  K-FACTOR 6 

b)  THE  M- FACTOR 6 

c)  THE  ALPHA  FACTOR 9 

d)  THE  BILLIKEN  FACTOR  (BF)  20 

4)  VERIFICATIONS  ON  INDEPENDENT  DATA 2 7 

5)  REVISED  ANALYTIC  FUNCTIONS 30 

6)  VERIFICATIONS  ON  DEPENDENT  DATA 32 

7)  CONCLUSION 32 

III.  MODELLING  THE  RUSSWO  DATA 36 

REFERENCES 47 


iii 


Figure  Pape 

1 Intersection  and  union 5 

2 K-factors  for  0000  Local  Time  in  January....  7 

3 Relationship  between  CMSIt  and  CMSI^  for  the 

winter  season  as  a function  of  distance  ....  8 


4 M-factors  for  expressing  meteorological 

homogeneity  for  ceiling  conditions  less 
than  200'  and/or  visibilities  less  than  1/2 
mile  as  a function  of  distance  and  month 10 

5a  Graph  of  CMSI-t  (heavy  curved  lines)  for 

Lakenheath-Mildenhall  and  Mildenhall- 
Lakenheath  comprising  the  distance  interval 
of  0-50  miles  for  all  months  of  the  year....  11 

5b  Graph  of  CMSI^  (heavy  curved  lines)  for 

Ramstein-Rhein  Main,  Rhein  Main-Ramstein 
Dover-McGuire , McGuire-Dover  and  Andrews- 
Dover  comprising  the  distance  interval  of 
50-100  miles  for  all  months  of  the  year 12 

5c  Graph  of  CMSI^  (heavy  curved  lines)  for 

Pope-Charleston,  S.C.,  Charleston-Robins , 
Robins-Dobbins  , Benning-Robin , Ft.  Hood- 
Bergstrom,  Altus-Tinker,  and  Dobbins- 
Robins  comprising  the  distance  interval  of 
100-200  miles  for  all  months  of  the  year...  13 

5d  Graph  of  CMSIt  (heavy  curved  lines)  for 

Travis- Alameda , Hill-Mt.  Home,  Eielsor.- 
Elmendorf,  Rota-Torre jon , Moron-Torre jon 
and  Torre jon -Moron  comprising  the  distance 
interval  of  200-300  miles  for  all  montns  of 


the  year 14 

6 Schematic  representation  of  the  Alpha- 

factor 15 

7 Values  of  oCas  a function  of  distance  for 

winter,  summer  and  the  annual  mean 19 


iv 


M, U*.  J.  + 


List  of  Illustrations 

Figure  Page 

8 B-Factors  for  expressing  meteorological 

homogeneity  ceilings  less  than  200*  and/or 
visibilities  less  than  1/2  mile  as  a func- 
tion of  distance  and  month 2 3 

9 Plot  of  Billiken  factor  against  distance 

for  winter,  summer  and  the  annual  mean 24 

10  Billiken-factor  isolines  for  constant  mileages 

plotted  against  the  month  of  the  year 2 5 

11  Variation  of  the  maximum  amplitude  term  of  the 

Billiken  function  with  distances 2 6 

12  Relative  accuracy  of  the  -method  and  BF- 

method  during  the  month  of  Jan 28 

13  Relative  accuracy  of  the  <A -method  and  BF- 
method  for  estimating  CMSI  during  the  month 

of  April 2 9 

14  Unconditional  probabilities  o^  ceilings  less 
than  1000  ft  and/or  visibilities  less  than 

one  mile  at  Otis  AFP 37 

15  Unconditional  probabilities  of  ceilings  less 

than  500  feet  and/or  visibilities  less  than 
one-half  mile  at  Otis  AFR 38 

16a  Estimated  probabilities  of  ceiling/ visibility 

200/M  for  upper  Heyford,  England  using  the 
equation  P200/M  = P500/M  + Bi(P1000/l  - P500/M).  40 

16b  Estimated  probabilities  of  ceiling/ visibility 

200/M  for  upper  Heyford,  England  using  the 
equation  P200/M  = PS00/M  + Bj  (P1000/1-  P500/M).  41 

16c  Estimated  probabilities  of  ceiling/ visibility 

800M  for  upper  Heyford,  England  using  the 
equation  P800/M  = P500/M  + B3(P1000/1  -P500/M).  42 

16d  Estimated  probabilities  of  ceiling/ visibility 

800M  for  upper  Heyford,  England  using  the 
equation  P800/M  = P500/M  + B4(P1000/1  - P500/M).  43 


v 


List  of  Illustrations 


Figure 

17 


18 


The  number  of  types  required  to  reproduce 
the  unconditional  probabilities  for  a 
given  RUSSWO  ceiling/ visibility  category 
for  an  unlimited  number  of  stations  for 
all  months  within  a given  maximum  allow- 
able percentage  error  (PC) 

The  numbers  indicate  which  of  the  31  types 
best  fit  the  diurnal  profile  of  the  listed 
stations  for  a given  month.  The  criteria 
are  ceilings  less  than  500*  and/or  visi- 
bilities greater  than  1/2  mile 


List  of  Illustrations 


Table  Pape 

I Percentage  of  times  when  CMSI gave 

larger  errors  than  the  Billiken-factor 

method  during  winter 31 

II  Verification  statistics  for  select  routes 

which  helDed  provide  the  original  data 
used  to  devise  equation  31 33 

III  The  mean  errors  (in  per  cent)  between  the 

revised  analytic  Billiken-factor  method  and 

the  true  CMSI  during  January 3 5 

IV  Twenty  five  stations  used  in  initial  study 39 


vii 


PERSONNEL 


j 


t 


J 

r 


The  following  members  of  the  Department  of  Earth  and 
Atmospheric  Sciences  were  engaged  in  the  research  program 
at  various  stages.  Professor  Donald  E.  Martin  was  the 
Principal  Investigator. 

James  P.  Long  did  much  of  the  pioneering  work  on  the 
CMSI  portion  of  the  report.  He  was  aided  by  John  Piess, 
Terry  Phelps  and  William  Talman  in  devising  the  analytic 
functions.  James  Wilson  and  Lorraine  stickel  contributed 
materially  to  all  aspects  of  this  research  during  its 
formative  stages.  The  verification  of  the  CMS!  formu- 
lations was  performed  by  Dale  Johnson  and  George  Jacroux. 

Robert  Bishop  and  James  St.  John  were  active  in  the 
RUSSWO  modelling  efforts  during  the  early  stages  of  that 
work.  Frank  Holt  and  Richard  Picanso  advanced  this  effort 
to  the  point  reported  in  this  document.  Eloise  Myers  and 
A1  Kreiner  conducted  research  in  various  areas.  The  re- 
gression equation  formulations  of  this  report  represents 
but  one  of  their  many  contributions. 

Numerous  students  participated  on  a less  extended 
basis.  Primary  among  these  were  Ron  Przbylinski , Gene 
Arbogast , Robert  Thiele,  Harvey  Ferdman,  Alan  Kikawa  and 
Gordon  Neithe. 

Mr.  Gringorten  of  AFGL  was  very  helpful  in  reviewing 
this  report  and  providing  valuable  contributions  with  re- 
gard to  its  content.  Albert  Boehm  of  AWS  provided  guidance 
to  the  project  through  frequent  consultations  with  the 
Principal  Investigator.  Mrs.  Frances  Brummell  served  as 
secretary  throughout  the  entire  course  of  the  research. 


i 


viii 


m .4  + 


INTRODUCTION 


Recent  innovations  in  Air  Weather  Services  have 
emphasized  the  need  for  modelling  climatic  data  in  order 
to  transfer  much  of  the  information  presently  stored  in 
book  or  page  form  into  the  computer  for  rapid  retrieval 
purposes.  This  presents  a host  of  challenges  to  the 
statistically-oriented , meteorologically-knowledgeable 
scientist  since  he  has  very  little  precedence  to  follow 
in  these  efforts.  Since  this  combination  of  talents 
is  rather  unique  to  anv  given  individual,  a team  of 
students  working  under  the  guidance  of  Professor  Martin 
has  been  assembled  to  attack  the  problem.  In  addition, 
a CMSI  coordinating  team  comprised  of  experts  from 
AFGL,  AWS  and  SLU  meets  twice  yearly  to  guide  the  over- 
all effort  arid  supplement  the  talents  of  the  Saint  Louis 
University  contingent. 

The  first  goal  of  this  research  was  to  model  joint 
ceiling  and  visibility  relationships  among  stations  as 
a function  of  distance  and  season.  We  have  system- 
atically progressed  through  four  phases,  each  of  which 
has  definite  merit  over  the  assumption  of  independency. 
The  end  result  is  a simple  modelling  method  which  pre- 
sents a ready  estimation  of  joint  probabilities  as  a 
function  of  season  and  distance.  The  analytic  formu- 
lations which  we  have  devised  are  applicable  for  any 
route  irrespective  of  direction,  topography  or  location. 
Verification  of  these  methods  on  both  dependent  and  in- 
dependent data  attest  to  their  merit  and  simplicity. 

A second  phase  of  the  research  involved  the  problem 
of  data  compaction.  The  unconditional  probabilities, 
in  particular,  represent  a massive  array  which  would 
quickly  saturate  the  largest  of  computer  storage  systems. 
By  taking  advantage  of  inherent  joint  relationships 
among  the  statistics  of  adjacent  points  in  the  RUSSWO  , 
diurnal  and  annual  imposed  trends,  and  through  the 
application  of  analysis  comoartmentization  techniques, 
we  have  found  it  possible  to  compact  the  data  by  several 
orders  of  magnitude. 

To  date  cur  efforts  have  been  confined  to  categories 
which  carry  combined  ceiling/ visibility  restrictions. 

The  procedures  developed  pertain  to  separate  considera- 
tions of  these  parameters  as  well.  Thus,  we  plan  to 
process  the  ceiling/ visibility  data  (either  separately 
or  in  combinations  as  the  need  requires)  for  some 
360  stations  where  RUSSWO  data  are  available.  This  will 
offer  an  on-line  capability  for  retrieving  unconditional 
probability  data  for  "any"  desired  point  over  the 
Northern  Hemisphere. 


MISSION  SUCCESS  INDICATORS 


The  mission  success  indicator  (MSI)  specifies  the 
probability  that  a particular  mission  can  be  success- 
fully completed.  Since  many  factors  in  addition  to 
weather  often  enter  into  its  calculations,  our  research 
represents  a restricted  version  of  the  MSI.  It  will 
henceforth  be  denoted  by  the  symbol,  CMSI.  It  is  the 
probability  that  a specific  mission  will  experience 
favorable  weather  conditions  based  entirely  upon  clima- 
tological considerations.  Such  information  offers 
valuable  inputs  into  both  the  planning  and  operational 
phases  of  military  missions. 


1)  Notations 

P(Au)  = Unconditional  probability  that 
point  A is  up  (above  specified 
minimums)  at  the  desired  time. 

P(Bd)  = Unconditional  probability  that 

point  B is  down  (below  specified 
minimums)  at  the  desired  time. 


PAdUBd  = The  union  of  the  two  unconditional 
probabilities . 

P(Au^Bu)  = The  intersection  of  the  two  uncon- 
ditional  probabilities. 


By  definition,  P(Au)  = 1 - P(Ad).  Thus  whether  the 
available  unconditional  probabilities  for  a station 
are  presented  as  being  above  or  below  minimums  is 
immaterial  since  a simple  conversion  can  be  made 
from  one  statistic  to  the  other. 


2)  Definitions  and  Examples 

a)  CMSI  Based  Upon  the  Assumption  of  Independency 
(CMSI. ) 

One  method  for  producing  the  CMSI  is  to  assume  in- 
dependency and  multiply  together  the  respective  un- 
conditional probabilities  that  each  station  used  in 
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the  flight  will  be  above  specified  minimums  at 
the  appropriate  take-off  and  landing  times.  The 
unconditional  probabilities  required  for  such 
computations  have  been  tabulated  by  hour  (or 
other  time  increment)  for  virtually  all  air  bases, 
airports  and  weather  stations.  They  have  been 
widely  distributed  and  are  commonly  available  for 
most  locations  where  CMSI's  are  desired. 

Example  1.  Assume  a departure  from  point  A,  with 
stopovers  at  points  B and  C,  and  a final  destin- 
ation at  point  D.  Ceiling/visibility  minimums  are 
5C0  ft/1.0  mile  at  all  points.  The  stopovers  are 
of  sufficiently  short  duration  that  the  uncon- 
ditional probability  of  being  above  minimums  at  B 
and  C are  the  same  for  both  arrival  and  departure 
times.  Suppose  the  following  unconditional  proba- 
bilities pertained: 

P(Au)  = .85  P(Bu)  = .80  P(Cu)  = .67  P(Du)  = .72 
Then,  by  this  method  of  computing  the  CMSI^, 

CMSIi  = P(Au)  X P(Bu)  X P(Cu)  X P(Du)  (1) 

= .85  X .80  X .67  X .72 
= .328  = 32.8% 

This  represents  a perfectly  valid  method  of  calcu- 
lating the  CMSI  IF,  AND  ONLY  IF  the  unconditional 
probability  at  each  point  is  independent  of  all 
the  other  unconditional  probabilities. 

b)  CMSI  which  Allow  for  Data  Dependencies 

The  true  CMSI,  i.e.,  CMSIt,  denotes  the  climatic 
probability  of  a successful  mission  whereby  no 
assumptions  are  made  with  regard  to  data  dependen- 
cies. Its  calculation  requires  a knowledge  of  the 
respective  unconditional  probabilities  and  their 
joint  relationships. 

CMSIt  = P(AuABuKCu*Du).  (2) 

CMSIt  = CMSI^  = P(Au)  X P(Bu)  X P(Cu)  x P(Du) 

if,  and  only  if,  the  unconditional  probabilities  of 
stations  A,  B,  C,  D are  independent  of  each  other. 

The  following  formula  pertains  when  the  respective 
unconditionals  of  two  stations  are  not  independent 

P(XaY)  = P(X)  + P(Y)  - P(XUY). 
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This  is  extended  to  several  stations  as  follows: 

(Here  A,B,C,D  = probability  of  that  station  being  up 
at  right  time,  i.e.  A = P(Au),  etc.) 

CMSIt  = AaB*CaD  = J(AaB)ac]aD  = [(A+B-AUB)ac]  A D (4) 

= ^A+B-AUB)+C-(A+B-AUB)UCJ A D 
= [(A+B-AUB+C-AUC-BUC-AUBUOJ  A D 

= ( A+B-AUB+C-AUC-BUC+AUBUC)  + D -( A+B-AUB+C-BUC-AUC+ 
AUBUOUD 

= A+B-AUB+C-AUC-BUC+AUBUC+D-AUD-BUD+AUBUD-CUD 
+AUCUD+BUCUD-AUBUCUD. 


CMSI  = A+B+C+D-AUB-AUC-BUC-AUD-BUD-CUD+AUBUC+AUBUD+ 

T AUCUD+BUCUD-AUBUCUD  . 

It  is  readily  seen  that  the  true  CMSI  becomes  quite  com- 
plex when  a number  of  dependent  stations  is  involved. 
Statistical  evaluation  of  CMSI  by  "brute  force"  sequential 
processing  of  data  for  just  two  stations  is  time  consuming 
even  on  the  fastest  computers.  Processing  multi-station 
routes  with  variable  lag  times  on  a real  time  request  basis 
is  virtually  out  of  the  question.  The  effects  of  data 
dependencies  essentially  work  to  make  CMSI-t"£ CMSIi • The 
net  result  is  most  apparent  when  the  unconditional  proba- 
bilities of  restricted  conditions  are  large  and  the  stations 
are  located  relatively  close  to  each  other.  This  will  be 
illustrated  by  an  example. 

Example  2 . Assume  that  the  distance  from  A to  B is  50  miles, 
B to  C 400  miles,  and  C to  D 25  miles.  Further  assume  (for 
the  moment)  that  stations  separated  by  distances  greater 


Now  A and  B are  independent  of  C and  D.  But  A and  B are 
dependent  and  C and  D are  dependent.  Then,  from  (2)  and 
(4): 

CMSIt  = P(AaBaCaD)  = P [(AaB)a(CaD)]  = P(AaB)  times  P(CAD).(5) 
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From  (3),  P(AaB)  = P(A)  + P(B)  - P(AUB)  (6) 

But  P(AUB)  is  the  probability 
that  at  least  one  of  A or  B 
is  upl  It  is  readily  seen  in 
Fig.  1 that  P(AUB)  must  be 
larger  than  either  of  the  two 
unconditional  probabilities 
and  that  P(A^B)  must  be 
smaller  than  either  of  the 
unconditionals  if  the  two 
are  dependent  but  not  co- 
located. In  accordance  with 
the  above,  let  us  arbitrarily 
assume  P(AUB)  = .90  as  a 
value  reasonably  in  excess  of 
P(A)  and  P(B)  for  the  first 
part  of  the  leg  and  P(CUD)  = 

.78  as  one  meeting  similar 
criterion  for  the  latter 
portion  of  a flight. 

From  (6)  P(A*B)  = .85  + 80  - .90  = .75. 

P(CAD)  s .67  + .72  - .78  = .61 . 

From  (5)  CMSIt  = P( A^B ) X P(CAD)  = 75  x .61  = 

.458  = 45.8%  . 

This  is  an  increase  of  13%  over  the  CMSIi  found  in 
example  I.  Increasing  the  number  of  stations  involved 
would  further  magnify  the  difference  between  CMSIi  and 
CMSI-t — especially  if  the  stations  were  sufficiently 
close  together  so  that  their  joint  climatologies 
assume  relationships  in  excess  of  random  occurrences. 

The  above  two  arbitrary  examples  serve  to  illustrate 
the  important  point  that  whenever  stations  are  located 
sufficiently  close  together  to  exhibit  dependently 
related  climatologies,  the  assumption  of  independence 
leads  to  CMSIi's  that  are  sufficiently  pessimistic  to 
be  misleading  to  a decisionmaker 

3)  CMSI^  Modelling  Efforts 

Our  research  efforts  have  been  restricted  to  modelling 
relationships  of  joint  probabilities  between  two 
different  stations  with  respect  to  specified  visi- 
bility and  ceiling  limitations.  A data  base  was 


Fig.  1.  Intersection 
and  Union. 


acquired  from  ETAC/AWS  which  contained  the  computer 
processed  data  necessarv  for  these  modelling  efforts 
for  39  sets  of  three -st at-’ on  combinations  distributed 
throughout  the  Northern  Hemisphere.  Computer-traced 
CMSI-t’s  were  computed  from  these  data  and  tabulated 
with  the  respective  unconditional  probability  values. 

The  latter  were  used  to  calculate  CMSIi  estimates. 
Differences  between  CMSTt  and  CMSIi  provide  a direct 
measure  of  the  joint  dependencies  between  the  two  re- 
spective stations. 

a)  The  K-factor  was  proposed  to  measure  this  de- 
pendency . Tts  numerical  value  when  multiplied  by 
CMSI^  yields  the  true  CMSI: 

CMSI  = CMSIi  X K = P(AU)  X P(Bu)  X K 

= jl  - P(Ad)J  X ^1  - P(Bd)][  X K. 

Or,  solving  for  K, 

CMSIt 

K - p.  - 

It  was  initially  anticipated  that  the  direction  of  flight 
from  A to  B would  have  a significant  bearing  on  the 
magnitude  of  K.  To  verify  this,  A was  taken  as  the  origin 
(center)  in  Fig.  2.  The  direc-'  ’on  and  distance  to  B was 
aligned  with  respect  to  that  poxnt  and  the  K-factor  was 
plotted  and  analyzed.  The  analysis  showed  some  northeast- 
southwest  elongation  as  the  K-factor  approaches  1.000 
(independence).  However,  as  the  distance  decreases,  the 
isolines  become  almost  circular.  This  led  to  the  con- 
clusion that  the  climatic  joint-dependencies  are  virtually 
independent  of  direction.  At  very  close  range,  CMSI-j-  was 
found  to  be  quite  sensitive  to  errors  in  the  K-factor. 
Insights  gained  from  a study  of  the  K-factor  fields  led 
to  the  development  of  other  more  promising  concepts. 

b)  The  M-Factor.  The  data  base  was  stratified  into  winter 
and  summer  seasons,  and  distance  groups  of  0-50,  50-100, 
100-200,  and  200-300  miles.  CMSI’s  were  plotted  on  the 
abscissa  and  CMSI-t’s  on  the  ordinate  of  a graph.  Their 
intersections  were  noted  by  dots.  A fairly  good  linear 
alignment  of  the  dots  emerged  within  each  distance 
range  (see  Fig.  3).  The  dots  were  oriented  close  to  the 
CMSIi  line  for  those  stations  which  were  very  far  apart 
and  nearly  independent.  The  more  the  stations’  data 
were  correlated,  the  closer  the  dots  became  oriented  to 
the  ’’perfectly-correlated”  line. 
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The  limits  of  the  slope  of  the  linear  relation  were  con- 
fined between  the  perfectly  correlated  and  uncorrelated 
representations.  Plots  of  the  500/1  data  and  those  of 
the  2OO/V5  data  were  compared.  Similar  orientations  were 
found  with  respect  to  the  best-fit  lines  connecting  the 
respective  arrays  of  dots.  The  significance  of  this 
finding  is  that  a common  M-value  cam  be  used  for  the 
1000/1  and  500A  criteria.  Presumably  this  commonality 
would  pertain  to  other  ceiling/visibility  categories  as 
well . 


The  M- factor  was  defined  as  follows: 
CMSIt  = CMSI f + (1  - CMSIi)  X M. 
Solving  for  M, 

CMSI . - CMSI. 

M = * i • 

1 - CMSI. 

1 


(9) 

(10) 


Fields  of  M-factor  for  500/1  and  200/^  were  generated  for 
all  months,  for  distances  from  0 to  250  miles  (see  Fig.  4). 
The  fields  were  smoothed  in  both  time  and  space  dimensions. 
They  can  be  easily  analyzed  and  are  consistent  with 
meteorological  reasoning  in  that  the  M-factor  decreases 
with  distance,  so  that  CMSIt  CMSIi.  The  annual  vari- 
ation is  cyclic  allowing  a smooth  trams ition  from  month 
to  month. 

c)  The  Alpha  Factor.  The  alpha-factor  method  was  a natural 
outgrowth  of  the  K- and  M-factor  research.  The  data  base 
was  again  stratified  into  summer/winter  seasons,  and 
mileage  ranges  of  0-50,  50-100,  100-200,  and  200-300.  An 
annual  (all  months)  category  for  each  mileage  range  was 
also  included.  P(Au)  s 1 - P(Ad)  was  put  on  the  ordinate 
and  P(Bu)  = 1 - P(Bd)  on  the  abscissa.  CMSI^  values  were 
plotted  at  the  intersection  of  the  respective  sets  of 
unconditional  probabilities  (see  Figs.  Sa,  5b,  5c  and  5d). 

The  smoothed  CMSIt  fields  were  easily  analyzed  and  limits 
of  the  isolines  were  established.  The  isoline  could  be 
no  closer  to  the  origin  them  the  solid  quasi-diagonal 
line  connecting  the  two  unconditionals  (the  isoline  of 
independence)  and  could  be  no  further  away  from  the  origin 
than  the  square  region  outlined  by  the  axes  and  the 
vertical  and  horizontal  lines  of  two  equal  conditionals. 

The  distance  on  a radial  from  the  origin  between  the 
CMSI^  diagonal  and  the  CMSIt  curve  is  a measure  of  the 
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Unconditional  probability  for  landing  station  one  hour  later 

Fig.  5a.  Graph  of  CMSI-f-  (heavy  curved  lines)  for  Lakenheath-Mildenhall  and  Mildenhall 
Lakenheath  comprising  the  distance  interval  of  0-50  miles  for  all  months  of 
the  year.  The  dashed  lines  pertain  to  CMSIi.  The  numbers  along  the  co- 
ordinate axis  pertain  to  either  set  of  CMSI  values. 
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Unconditional  nrobability  for  take-off  station 


increase  of  CMSIt  over  CMSIi  due  to  the  dependent  re- 
lationship be^-'en  stations.  Note  that  the  curvature 
of  the  isolines  decreases  with  increasing  distance  so  that 
CMSI-t"* CMSIi.  The  quantitative  measure  of  this  curvature 
change  is  also  a measure  of  the  change  of  dependence.  CMSI 
graphs  for  both  500/1  and  200/5$  were  produced  to  learn 
(as  was  the  case  for  the  M-factor  values)  that  a common 
graph  would  suffice  for  both  sets  of  ceiling/ visibility 
criteria. 


An  analytic  function  was  developed  to  capture  the  relation 
ships  contained  in  these  graphs.  To  do  this,  the  CMSIt 
field  was  set  up  on  an  X,  Y cartesian  coordinate  system 
with  X = P(Bd)  = 1 -P(Bu),  and  Y = P(Ad)  = 1 - P(Au) 

(see  Fig.  6).  The  field  was  assumed  to  be  symmetric  about 


Fig.  6.  Schematic  representation  of  the  Alpha-factor 


the  diagonal  (x  = y).  A hyperbolic  function  which  describes 
a curve  similar  to  the  CMSI.  isoline  is 


maximum 
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X = P(Bd),  Y = P( Ad) , CMSI . = 1 - Y 

t m 

value  within  a given  distance  range, 
root  from  (18)  yields 


and  <£>  is  a constant 
Applying  the  positive 


CMSIt  = 1 


v . P(Ad)  + P( Bd)  - 
m " 2 


-(p(Ad>)(P<Bd>) 


(19) 


A mean  was  calculated  for  each  distance  range  and  CMBIt 
fields  were  calculated  using  (19).  Agreement  with  the 
original  fields  was  quite  good.  In  summary,  given  P(Ad), 
P(Bd),  and  a table  of  ct  values,  estimated  CMSI^  could  be 
generated  by  analytic  formulation. 


u) 


The  next  step  was  to  formulate  the  Alpha-factor  as  a func- 
tion of  distance  and  month.  Station  combinations  with 
similar  separation-distances  were  grouped  and  CMSIt  granhs 
were  generated  for  December/ January  and  June/ July  for  a 
1-hr  lag.  The  limits  of  were  known  to  range  from 
0.5  1.0.  The  function. 


* = 


(20) 


was  found  to  be  a fairly  good  representation  of  the  curve  in 
Fig.  7 when  Y is  set  equal  to  70  for  Dec/Jan,  and  30 
for  Jun/Jul. 


The  function. 


50  + 20  cos 


(21) 


was  introduced  to  account  for  the  annual  variation  of 

Here  M identifies  the  month  with  January  = 1 and  December 
= 12  and  S represents  distance. 

iii)  The  complete  Alpha  function  then,  is 
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50  ♦ 20  cos 


(M-l  )7T 


-*(Y 

o(  = e + 1 


CMS  I 


P( Ad)+P( Bd)  - 


l + PCBdVT  - P(Ad)P(Bd). 


2ca 


Equation  22  permits  the  calculation  of  CMSI^  from  a 

knowledge  of  the  two  unconditional  probabilities,  the  separa- 
tion distance,  and  the  month.  This  involves  no  special 
graphs.  Only  trigonometric  and  exponential  tables  are 
needed. 


Eig.  7 . Values  of  «L  as  a function  of  distance  for  winter,  summer  and 
the  annual  mean.  The  numbers  in  the  drawing  refer  to  the 
number  of  stations  used  in  that  distance  range  to  produce 
these  values. 
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d)  The  Billiken  Factor  ( BF ) 


The  K-factor,  M- factor  and  •<  -factor  method  nrovide  ways 
of  modelling  joint  probability  relationships  as  a function 
of  distance  and  season.  Thev  do,  however,  have  certain 
shortcomings  that  can  be  circumvented.  The  K-factor  and 
M-factor  technioues  act  to  modifv  the  assumption  of  in- 
dependency. No  attempt  is  made  to  distinguish  the  relative 
magnitudes  of  the  unconditional  probabilities  involved. 

For  example,  a CMST^  of  81*  could  be  made  up  of  two  un- 
conditional probabilities  of  90%  each  or  one  of  100%  and 
another  of  81%.  In  the  first  instance  the  possibility  of 
a joint  relationship  of  10%  exists.  In  the  latter  case 
the  possibility  of  a joint-relationship  is  zero.  Thus 
the  K-factor  and  M-factor  method  work  best  under  certain 
implied  homogeneties  of  unconditional  probability  magni- 
tudes. They  fail  when  these  conditions  are  not  met. 

The  o<  -factor  method  remedies  this  situation.  It  demands, 
however,  that  a large  number  of  graphs  be  constructed.  A 
fifty-mile  range  is  much  too  large  in  the  proximity  of  a 
station  to  accurately  pin  down  an  exponential  decay  curve. 

A clue  to  an  alternative  solution  to  the  problem  is  found 
in  figure  3.  Here  the  distance  between  the  perfectly  cor- 
related line  and  the  CMFI^  line  represents  the  maximum 
joint  dependency  possible  between  two  stations  under  a given 
set  of  unconditional  probabilities.  It  is  important  to 
note  that  the  smaller  of  the  two  unconditional  probability 
values  (1-PA)  and  (1-PR)  also  defines  this  same  maximum 
joint  relationship.  To  illustrate  this  point,  let  us  ex- 
amine two  points  on  curve  A,  one  where  the  independent 
assumption  as  designated  on  the  abscissa  is  76  and  the  other 
where  it  is  86.  Be  note  that  curve  A is  spaced  about  4 graph 
units  from  the  CMFT^  line  at  the  76%  point  and  that  the 
total  width  of  the  sector  of  maximum  joint  relationship  at 
that  interval  is  some  5V  units.  Hence  the  ratio  of  the 
observed  joint  dependency  to  the  maximum  is  some  70%.  At 
the  point  where  CMFI  is  86%,  the  sector  is  some  3V.  units  wide 
and  curve  A takes  up  2V,  units  of  that  value.  This  also  repre- 
sents a ratio  of  approximately  70%.  Thus  somewhat  ir- 
respective of  the  magnitude  of  the  maximum  possible  joint 
relationship,  30%  of  that  value  decays  off  in  the  first 
fifty  miles. 

At  the  same  points  with  respect  to  curve  B the  ratios  be- 
tween observed  joint  probabilities  and  their  maxima  are 
1-3/4  to  5-1/2  and  1 to  3-1/4  respectively,  i.e.,  approx- 
imately 30%  in  each  case.  This  factor  (or  ratio)  has 
come  to  be  called  the  Billiken  factor  (BF).  It  can  readily 
be  determined  from  the  data  base  of  each  and  every  station 
thereby  eliminating  the  problem  of  constructing  voluminous 
networks  of  graphs  to  model  the  decay  of  joint  dependencies 
with  distance. 
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The  CMST  is  the  nrobabilitv  that  both  A and  B are  ud 
or  one  minus  the  nrobabilitv  that  either  one  o^  A 
or  B is  down  (i.e.,  one  minus  the  DrobaSilitv  that  both 
will  not  he  un) . 


CMRIt  = P(AuaBu)  = 1 - P (AdUBd).  (23) 

From  ( 3) , 

P( AdUBd)  = PC  Ad ) + P(Bd)  - P(Ad^Bd).  (24) 

The  maximum  possible  value  of  the  intersection  is  the 
smaller  of  the  two  unconditional  probabilities.  Given 
two  unconditional  probabilities,  they  are  obviously 
either  equal  or  one  is  larper  than  the  other.  Let  us 
assume  P(Ad)i  P(Bd)  and  define  L such  that 

L X P( Bd)  = P(AdABd).  (25) 

Then  from  (24) 

P( AdUBd ) = P(Ad)+P(Bd)  -LX  P(Bd)  = P(Ad)+P(Bd)  X (1-L) 
and  from  (23) 

CMSIt  = 1 - fP(Ad)  + (1-L)  X P(Bdj). 

We  shall  define  the  Billiken  factor  as  BF  = 1 - L . (26) 

Than,  CMSIt  = 1 - P(Ad)  - BF  X P(Bd) 

= l-(larper  uncond.)-BF  X (smaller  uncond.).  (27) 

The  Billiken  factor  is  defined  as  that  numerical  value  which, 
when  multiplied  by  the  smaller  of  the  two  unconditional 
probabilities,  results  in  the  correct  CMSIt  in  (27). 

In  other  words , 

gp  _ l-(larper  unconditional)  - CMSIt 
(smaller  unconditional) 


or 


BF  = 


P( A ) 
u 


P(  A 


B ) 
u 


W 


= p‘AuA  V = 

P(Rj 


P ( A 


BdJ' 


(28) 


It  is  the  conditional  probability  that  A is  up,  piven  that 
B is  down.  Hence  it  is  to  be  expected  that  BF  will  be  zero 
when  the  distance  between  A and  B is  zero,  that  BF  will  tend 
to  the  limit  P(AU)  as  the  distance  increases  and  that  BF 
could  even  approach  1.0  for  a negative  relationship  between 
A and  B. 


jjfrJL,  4*- 
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An  analytic  function  has  been  fit  to  the  500/1  field. 

The  200/ij  function  would  be  somewhat  similar  except  for 
adjustments  in  the  empirical  constants  needed  to  fit 
the  data  in  the  lower  portion  of  figure  8 rather  than 
that  of  the  upper  part.  The  major  feature  of  the  Billiken 
factor  field  is  its  increasing  value  with  increasing 
distance  and  its  variation  with  month.  The  range  of  the 
Billiken  factor  is  from  0 to  1.0  but  a Billiken  factor 
value  greater  than  P(Au)  becomes  erroneous  unless  the 
joint  probabilities  between  two  stations  are  negatively 
correlated — a relationship  that  the  data  did  not  con- 
clusively substantiate. 


Mean  BF  values  for  Dec/Jan  and  Jun/Jul  from  Fig.  8 were 
plotted  against  distances  (see  Fig.  9).  These  curves 
can  be  approximated  by  the  function, 


BF  = 


1 


- e-S/M 


(29) 


where 'V  = 150  for  Dec/Jan  and  ^ - 64  for  Jun/Jul.  Plots  of 
BF  isolines  at  constant  mileage  against  month  (see  Fig.  10) 
reveal  an  annual  variation  which  is  not  centered  about 
the  Dec/Jan  - Jun/Jul  mean.  Instead,  the  cur re  is 
dropped  somewhat  below  this  mean  having  a value  of^ 

AJ  = 94  for  March,  April,  September,  and  October.  The 
maximum  amplitude  of  the  BF  monthly  variation  is  plotted 
in  Figure  11.  The  amplitude  is  bounded  by  zero  at  zero 
distance  and  tends  towards  zero  at  infinite  distance. 
Several  statistical  distribution  functions  have  these 
properties  (i.e.  , Poisson,  Chi  Square,  etc.).  However 
the  functionj 


Amp  - A • S 

max  B.expre/PT 

was  found  to  be  the  simplest  in  form.  Its  coefficients 
are:  A = 0.352,  B = 100,  C = 117.6. 


Thus, 


Amp  = 0.352  S 

max  106  exp(S/lTT.Fy 


( 30) 


The  cyclic  portion  of  the  monthly  variation  is  incorpor- 
ated by  introducing  the  function, 

cos  ^ M + 5 ) ttJ  . 

The  complete  analytic  expression  for  obtaining  Billiken 
factors  then  becomes 


BF 

BF 


(mean  value) 


(l-e-S/94> 


+ (max  amp)X( cyclic  variation) 
or 


0.352  S X cos  T(M+5)rr7  . 

100  expT37nTTB7  L 6""-* 


(31) 
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for  winter,  summer  and  the  annual  mean. 
The  crosses  (X)  denote  the  summer 
curve  obtained  from  observed  data  with 
the  adjacent  isoline  representing  the 
analytic  representation  for  that  season. 
The  dotted  values  next  to  the  Dec. -Jan. 
curve  pertain  to  the  observed  trace  and 
the  solid  line  to  the  analytic  formula- 
tion for  winter. 
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Fip.  10.  Billiken-factor  isolines  for  constant 
rileapes  plotted  apainst  the  nonth 
of  the  year. 
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Fig.  11.  Variation  of  the  maximum  amplitude  term  of  the  Billiken 
function  with  distance.  The  dotted  lines  represent 
the  actual  curve  which  the  function  is  attempting 
to  reproduce. 
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From  (27)  the  estimated  CMSI^  equals  1-Clarger  uncond)  - 
BF  X (smaller  uncond). 

Equation  31  permits  the  calculation  of  CMSI^'s  which 
incorporate  seasons  and  distance  dependencies  from 
merely  a knowledge  of  unconditional  probabilities  and 
distance  and  month. 

4)  Verifications  on  Independent  Data 

The  goals  of  this  verification  were  to  evaluate  the 
three  most  promising  methods  of  estimating  CMSI*  and 
select  the  best  for  further  improvements.  The  first 
portion  of  that  goal  was  conducted  on  independent  data 
only.  The  improvement  phase  (second  step)  incorporated 
these  additional  data  into  the  formulae  to  better  tune 
the  coefficient  involved.  The  three  methods  to  be 
tested  were: 

1)  The  Alpha  method  of  equation  (22). 

2)  The  graphical  Billiken  factor  of  Fig.  8. 

3)  The  analytic  formulation  of  (2)  given 
by  equation  ( 3 ) . 

The  ceiling  and  visibility  criteria  of  500  feet/1  mile 
were  chosen  with  a 1-hour  lag  time  between  the  various 
pairs  of  station  combinations  with  distances  ranging 
from  1 to  306  miles.  Data  from  fifteen  different 
stations  were  used.  Ten  locations  were  in  Texas,  two  in 
California,  and  three  in  the  Washington,  D.  C.  area. 

These  data  were  not  used  in  deriving  the  analytical  func- 
tions. Januarv,  July  and  April  were  selected  to  provide 
data  representative  of  the  winter,  summer  and  transitional 
season.  The  statistics  for  these  months  were  nroduced 
on  the  Saint  Louis  University  computer  using  hourlv 
history  tapes  provided  bv  ETAC.  An  analysis  of  these 
independent  data  showed  the  graphical  Billiken-factor  method 
to  consistently  provide  the  best  estimation  of  the  true 
CMSI.  The  analytical  Billiken-factor  method  ranked  a close 
second  with  the  Alpha  method  placing  a rather  respectable 
last.  Graphical  repre sent at ions  of  the  analytic  Alpha 
and  Billiken-factor  verifications  are  shown  in  figures  12 
and  13  for  the  months  of  January  and  April. 

The  decrease  in  error-amplitude  between  January  and 
April  is  primarily  attributable  to  the  better  weather 
conditions  (fewer  frequencies  of  low  ceilings/ visibilities ) 


r 
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during  the  transition  months  with  respect  to  winter.  In 
fact,  during  July  there  were  so  few  occurrences  of  the 
500/1  criteria  for  the  stations  tested  for  either  method 
to  show  a significant  improvement  over  that  given  by  the 
assumption  of  independency.  In  other  words,  the  CMSIt 
estimations  best  show  improvements  over  the  independent 
assumption  method  when  the  weather  is  at  its  worse  and  a 
dependency  exists  between  the  climatologies  of  the  two 
stations.  Verifications  of  the  Billiken-factor  method  for 
generating  CMSI-j-'s  were  first  made  on  completely  inde- 
pendent data.  Table  I gives  a summary  of  the  verifi- 
cation results  for  the  fifteen  independent  stations. 


5)  Revised  Analytic  Functions 

The  verifications  of  Section  4 showed  certain  consistent 
biases  which  could  presumably  be  eliminated  by  adjusting 
the  coefficients  of  equation  31.  These  verifications 
also  exhibit  sufficiently  close  agreement  between  the 
graphical  Billiken-factor  method  and  its  analytic  repre- 
sentation for  us  to  select  the  analytic  function  method 
since  the  latter  can  be  much  more  readily  incorporated 
into  computer  oriented  procedures.  Figure  12  and  13 
show  a pronounced  bias  when  the  distances  between 
stations  are  short.  This  is  understandable  since  the 
coefficients  of  equation  31,  as  well  as  the  data  of 
Fig.  8,  were  based  upon  only  two  routes  (Lakenheath- 
Mildenhall  and  Mildenhall-Lakenheath)  in  the  0 to  50 
mile  distance.  To  minimize  this  bias,  the  formerly  in- 
dependent data  processed  by  Saint  Louis  University  for 
stations  located  close  together  were  now  added  to  the 
formerly  dependent  sample  obtained  from  ETAC  to  enlarge 
the  data  base  and  permit  a finer  tuning  of  the  coeffi- 
cients. Recalling  from  equation  31,  the  original  formu- 
lation was 


mean  value 


BF=l-exp(-S/94) 


max.  amplitude 


0.352  S 


cyclic  variation 


X cos 


f(m+5)7T\  (31) 


Based  upon  the  expanded  data  base  the  max.  amplitude 
function  was  changed  to 


0.0041 
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Route 

Distance  (nm) 

Month- January 

Washington,  D.  C.  - Bolling  AFB 

1 

100 

Bolling  AFB  - Andrews  AFB 

7 

100 

Washington,  D.  C.  - Andrews  AFB 

8 

88 

Randolph  AFB  - San  Antonio 

10 

100 

Kelly  AFB  - San  Antonio 

11 

100 

Randolph  AFB  - Kelly  AFB 

18 

100 

Ft.  Hood  - Waco 

45 

100 

San  Francisco  - Sacramento 

75 

100 

Ft.  Hood  - Randolph  AFB 

95 

100 

Kelly  AFB  - Ft.  Hood 

108 

100 

San  Antonio  - Corpus  Christi 

118 

88 

Ft.  Hood  - Dallas 

118 

100 

Reese  AFB  - Dyess  AFR 

131 

100 

Ft.  Hood  - Dyess  AFB 

132 

100 

Randolph  AFB  - Waco 

136 

100 

San  Antonio  - Waco 

140 

100 

Dyess  AFB  - Dallas 

154 

71 

Waco  - Victoria 

167 

100 

Dyess  - San  Antonio 

188 

50 

Randolph  AFB  - Dallas 

212 

25 

Reese  AFB  - Dallas 

265 

14 

Table  I.  Percentage  of  times  when  CMSI^  gave  larger  errors 
than  the  Billiken-factor  method  during  winter. 
Note  the  general  decay  of  these  percentages  with 
distance  as  the  joint  relationships  between  sta- 
tions approach  independency.  Here  a criterion 
that  each  of  the  unconditionals  must  exceed  2% 
was  invoked  to  enhance  the  reliability  of  the 
statistics . 
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The  first  tern  in  the  revised  version  of  equation  31 
was  found  to  he  nearly  a straight  line  when  plotted 
on  semi-lop  naper  which  could  be  represented  bv  the 
equation  0.216  In  S - 0.347  where  S is  arbitrarilv 
bounded  bv  7 ^ R ^ 400  miles.  ^hese  limits  impose  no 
restrictions  on  the  usages  of  the  function  in  actual 
practice  since  values  in  excess  of  400  miles  can  be 
assumed  to  be  independent.  The  Pilliken  factor  for 
the  seven  mile  range  was  used  for  distances  ranpinp 
from  0 through  6 miles  as  well  to  avoid  blow-ups  as 
the  function  in  equation  32  approaches  zero. 

The  complete  revised  analytic  function  then  becomes 

BF  = O.^IB  In  S - 0.347  + 0.0041  S x cos  ( (m+5)T A.  (32) 

exp(B/l!5)  6 / 


6)  Verifications  on  Dependent  Data 

The  analytic  formulation  was  tuned  to  encompass  the 
independent  station  pairs  listed  in  Table  I and  the 
original  39  pairs  of  stations  used  to  derive  eauation 
31.  Table  II  presents  verification  statistics  on  a 
representative  portion  of  the  originally  dependent 
sample.  This  table  shows  that  certain  routes  of  short 
distances  such  as  Travis  to  Alemeda,  for  instance,  fit 
the  analytic  function  less  well  than  others.  Often 
weather  at  one  station  comprising  the  route  may  be  ex- 
tremely localized  due  to  orographic  or  topographic  effects. 
Hence  its  dependency  characteristics  with  respect  to 
other  stations  represent  an  anomaly  from  the  climatic 
group.  In  such  instances  forecast  MSI's  should  be 
more  heavily  relied  upon  than  their  modelled  climatic 
counterparts.  It  should  be  possible  to  identify  such 
anomalous  routes  for  special  consideration  in  the 
planning  decision  process  from  a knowledge  of  the 
location  and  topography  of  the  stations  involved. 

Table  III  is  based  upon  January  data  and  provides 
magnitude  estimates  of  the  improvements  over  the  inde- 
pendent assumption  method  made  by  using  the  revised 
Billiken-factor  techniques. 

7)  Conclusion 

The  analytic  functions  of  either  equations  31  or  32 
represent  a convenient  method  for  estimating  CVSIt 
without  having  to  resort  to  tracing  the  data  throughout 
the  records  of  the  various  stations  involved.  Either 
method  can  be  expected  to  consistently  show  improved 
results  over  the  assumption  of  independency,  particu- 
larly when  stations  are  situated  close  together  and 
exhibit  high  frequencies  of  non-localized  bad  weather. 
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Route 


Distance  (nm)  Month-January 


Mildenhall-  Lakenheath 
Travis  AFR  - Alameda 
Ft.  Hood  - Bergstrom  AFR 
Ramstein  - Rhein  Main 
Andrews  AFB  - Dover  AFB 
W.  Robins  AFB  - Dobbins  AFB 
Altus  AFB  - Tinker  AFB 
Pope  AFB  - Charleston 
Charleston  - W.  Robins 


4 

100 

34 

6 5 

52 

100 

52 

100 

68 

100 

90 

100 

102 

100 

145 

71 

181 

88 

Table  II.  Verification  statistics  for  select  routes  which 
helped  provide  the  original  data  used  to  devise 
equation  31.  The  numbers  denote  the  percentage 
of  times  when  CMSI-  gave  larger  errors  than  the 
Billiken-factor  method.  A criterion  was  invoked 
to  enhance  the  reliability  of  the  statistics. 


Afit'  4u  -4  + 


■**  V 


-34- 


The  research  so  far  pertains  to  flights  between  two 
stations.  As  more  stations  become  involved  the  method 
will  need  to  specify  when  to  use  the  independent 
assumption  and  when  to  assume  dependency  relationships 
in  a manner  somewhat  akin  to  that  shown  bv  example  2 
of  this  report.  The  details  of  these  incorporations 
will  constitute  the  thrust  of  subsequent  research 
efforts . 
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Route 

Washington,  D.C.  - Bolling 

Bolling  - Andrews 

Washington,  D.C.  - Andrews 

Randolph  - San  Antonio 

Kelly  - San  Antonio 

Randolph  - Kelly 

Ft . Hood  - Waco 

San  Francisco  - Sacramento 

Ft.  Hood  - Randolph 

Kelly  - Ft.  Hood 

San  Antonio  - Corpus  Christi 

Ft.  Hood  - Dallas 

Reese  - Dyess 

Ft.  Hood  - Dyess 

Randolph  - Waco 

San  Antonio  - Waco 

Dyess  - Dallas 

Waco  - Victoria 

Dyess  - San  Antonio 

Randolph  - Dallas 

Reese  - Dallas 


Distance 

(nm) 

Col.  1 

Col. 

1 

0.4 

CO 

• 

CM 

7 
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00 

3.7 

8 

0.5 
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10 

1.2 

8.1 

11 

0.5 

7.6 

18 

to 

• 

o 

7.0 

45 

O') 

• 

c 

5.7 

75 

0.5 

2.5 

95 

1.2 

5.0 

108 

1.5 

4.4 

118 

1.0 

3.1 

118 

0.7 

2.9 

131 

0.9 

2.4 

132 

1.4 

3.3 

136 

1.8 

4.8 

140 

1.2 

4.1 

154 

to 

• 

o 

1.6 

16  7 

1.4 

4.2 

188 

0.4 

CM 

• 

rl 
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1.6 

2.7 

265 
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Table  III.  The  mean  errors  (in  per  cent)  between  the  revised 
analytic  Billiken-factor  method  and  the  true  CMSI 
during  January  are  shown  in  Column  I.  The 
mean  errors  (in  per  cent)  between  the  independent 
assumption  method  and  the  true  CMSI  are  similarly 
given  in  Col.  2. 
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III.  MODELLING  THE  RUPSWO  DATA 

Attempts  were  made  to  model  the  ETAC  Revised  Uniform 
Summary  of  Surface  Weather  Observations  (RU5SW0)  to  compact 
the  voluminous  information  on  ceilings  and  visibilities 
which  they  contained  into  a minimum  of  computer  statements. 

We  were  successful  in  this  attempt  since  the  cumulative 
alignment  of  these  respective  unconditional  probabilities 
present  the  data  in  a form  amenable  to  isoline  analyses. 

These  analyses  exhibited  the  following  useful  characteristics . 

1)  The  isolines  for  any  given  month  and  hour  are 
relatively  smooth  and  evenly  spaced. 

2)  Their  magnitudes  and  gradients  show  cyclic 
annual  and  diurnal  variations. 

These  two  features  allow  for  data  interpolations  and 
deductions  which  lie  at  the  heart  of  the  problem  we  set  out 
to  solve.  For  example,  the  abbreviated  presentation  shown 
in  figure  14  contains  a wealth  of  information  about  that  sta- 
tion. Here  one  particular  ceiling/ visibility  category 
(1000/1)  has  been  extracted  from  the  RUSSWO  by  month  and  hour. 
Note  the  annual  trend  to  the  unconditional  probabilities  from 
January  to  July  and  the  narked  differences  in  diurnal  trends 
between  the  winter  and  summer  season.  A similar  presentation 
is  shown  in  figure  15  for  a less  restrictive  ceiling/ visi- 
bility criterion  (500/%).  A comparison  of  these  two  figures 
shows  a large  decrease  in  ceiling/ visibility  probabilities 
from  the  higher  to  lower  category.  These  two  specific  ceil- 
ing/visibility criteria  were  chosen  since  they  lie  close  to 
the  diagonal  extending  outward  from  the  lower  right-hand 
comer  of  the  RUSSWO  sheet  occupied  by  the  zero  percent  bad 
weather  (100%  good  weather).  Consequently,  a knowledge  of 
these  particular  unconditional  probability  values  defines 
the  gradients  of  the  isolines  of  probability  of  occurrences 
to  permit  interpolations  to  obtain  values  for  nearby  cate- 
gories of  ceiling  and  visibilities.  That  is,  the  character- 
istic behavior  of  these  two  select  categories  reflects 
those  for  neighboring  categories-  as  well,  with  sufficient  ac- 
curacy to  obviate  the  need  for  modelling  a massive  array  of 
information  at  manv  points  to  deduce  the  field  arrav  o*  ceil- 
ing/visibility information  contained  in  the  RU9PWP.  uence, 
ceiling/ visibilitv  probabilitv  estimates  for  categories 
other  than  1000/1  and  500/%  were  deduced  by  regression 
equations  of  the 

Y = P (500/%)  - B [P  (1000/1)  - P (500/%)J  . 

Here  Y is  occurrence  probability  for  the  ceiling/ visibility 
category  desired  and  B is  an  empirically  derived  function. 

The  results  indicate  that  it  is  possible  to  estimate  reason- 
ably accurate  probabilities  for  any  category 4(1000/1 ) from 


Fifr.  14.  Unconditional  probabilities  of  ceilings  less  than  1000  ft  and/or 
visibilities  less  than  one  mile  at  Otis  AFO. 


Fig.  15.  Unconditional  probabilities  of  ceilings  less  than  500  feet  and/or 
visibilities  less  than  one-half  mile  at  Otis  AFB. 
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a knowledge  of  the 
16b,  16c  and  16d). 


>1000/1  and 


>500/Jj  values  (see  Figs. 


16a, 


Having  discovered  the  cyclic  nature  of  the  probability 
values  with  respect  to  hour  and  month  and  the  feasi- 
bility of  deducing  adjacent  categories  from  a knowledge  of 
two  strategically  chosen  ones , our  attention  was  diverted 
to  how  best  to  represent  this  information  in  computer  format. 
Here  choices  in  play-offs  between  accuracy  and  compactness 
are  involved.  Should  utmost  accuracy  be  required,  it  is 
suggested  that  values  at  each  of  the  eight  three -hourly  group- 
ing be  rounded-off  and  each  month's  data  stored  in  the  com- 
puter as  a multi -numbered  word.  For  example,  the  January 
values  at  Otis  Air  porce  Base  in  figure  14  would  be  entered  as 
the  sixteen  digit  word,  1718181817171617. 

The  arrays  of  figures  14  and  15  can  be  represented  by 
"types"  in  a still  more  compacted  form.  In  this  regard  we 
suggest  that  the  data  be  "normalized"  by  dividing  the  actual 
probabilities  at  each  time  grouping  bv  the  mean  of  the  month. 
By  dividing  out  the  mean,  which  is  an  indication  of  local 
effects  on  the  observed  weather,  emphasis  is  placed  on  the 
effects  of  the  solar  forcing  function  which  varies  with  lati- 
tude and  altitude.  This  latter  effect  can  be  represented  by 
generalized  types  whose  values,  when  multiplied  by  the  monthly 
mean,  reproduce  the  probabilities  for  the  particular  station 
involved. 

In  the  preliminary  stages  of  our  investigation,  data  were 
extracted  from  the  USAF/ETAC  Revised  Uniform  Summary  of 
Surface  Weather  Observations  (RUSSWO)  for  twenty-five  select 
stations.  Two  categories  were  examined:  ceilings  greater 

them  1000  ft  and  visibilities  greater  than  1 mile  and  ceilings 
greater  than  500  ft  and  visibilities  greater  than  1/2  mile. 
This  gave  96  probabilities  for  each  station  (the  data  for 
twelve  months  for  eight  3-hourly  periods).  One  such  array  for 
each  ceiling/ visibility  category  is  presented  for  Otis  Air 
Force  Base  in  figures  14  and  15.  Thus,  a total  of  300  data 
arrays  (12  months  x 25  stations)  were  available  for  typing  in 
the  preliminary  aspects  of  this  research.  The  stations  used 
are  listed  in  Table  IV. 


Chitose  AB,  Japan 
Chiang  Mai,  Thailand 
Kingsley  Fid. , Ore. 
Astoria,  Ore. 

Spokane,  Wash. 

Seattle,  Wash. 

Whidhey  Island,  Wash. 
Walla  Walla,  Wash. 
McGuire  AF3 , N.  J. 

Ft.  Campbell,  Ky. 

K.  I.  Sawyer  AFB,  Mich. 
Kincheloe  AFB,  Mich. 
Otis  AFB,  Mass. 


Lor in g AFB,  Me 
Great  Falls,  Mont. 
Andrews  AFB,  Md. 
Spanpdahlem  AF,  Germany 
Aviano  AB,  Italy 
Brunswick,  Ga. 

Memphis,  Tenn. 

Beaufort,  S.  Car. 

Minot  AFB,  N.  Dak. 
Elizabeth  Citv,  N.  Car. 
Scott  AFB,  111. 

Ft.  Riley,  Kan. 


Table  IV.  Twenty-five  stations  used  in  initial  study  . 
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Fig.  16a.  Estimated  probabilities  of  ceiling/ visibility  200/^5  for  upper  Heyford, 
England  using  the  equation  P200/^  = P500/?s  + B^tPlOOO/l  - P500/%).  jhe 
actual  values  are  plotted  underneath  for  comparison  purposes. 
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Fig.  16b.  Estimated  probabilities  of  ceiling/ visibility  2 00 /*ti  for  upper  Heyford 
England  using  the  equation  P200A;  = P500/ls  + B?  (P1000/1  - P500/H  ). 
actual  values  are  plotted  underneath  for  comparison  purposes. 
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Fig.  16c.  Estimated  probabilities  of  ceiling/ visibility  800/lj  for  upper  heyford, 
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actual  values  are  plotted  underneath  for  comparison  purposes. 
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A simple  model  which  takes  the  normalized  diurnal  un- 
conditional probability  curve  for  any  given  month  and  station 
from  the  test  data,  smooths  it  and  labels  it  as  Type  1 was 
used.  All  other  normalized  diurnal  unconditional  probability 
profiles  were  then  compared  with  that  type.  Those  profiles 
which  match  the  Type  1 curve,  within  a specified  tolerance, 
were  labeled  and  temporarily  removed  from  the  sample.  The 
procedure  was  repeated  by  arbitrarily  designating  another 
one  of  the  remaining  normalized  diurnal  profiles  as  TyDe  2 
and  matching  all  the  remaining  profiles  against  it.  Pro- 
ceeding in  this  fashion,  iterations  were  continued  until  all 
the  diurnal  profiles  for  all  stations  for  all  months  were 
typed.  A maximum  error  at  any  point  in  the  diurnal  profile 
of  ±2%  was  set  as  the  tolerance  for  fitting  these  types . 

That  is,  a given  unconditional  probability  must  be  repro- 
duced within  an  error  of  2.0%  at  every  grid  point  for  a type 
to  be  selected.  For  example,  if  the  true  value  is  10.3%  at 
any  given  grid  point,  the  regenerated  unconditional  proba- 
bility for  that  point  must  fall  between  8.3  and  12.3  percent. 

Figure  17  indicates  the  number  of  types  which  would  have 
been  required  to  reproduce  300  data  sets  within  certain 
error  tolerances.  A refinement  was  introduced  into  the 
procedure  after  all  types  have  been  established  to  be  sure 
that  a station's  data,  which  may  meet  the  tolerances  set 
for  more  than  one  type,  are  placed  in  the  one  which  it  fits 
the  best.  Further  refinements  contemplated  are  to  introduce 
other  criteria  for  determining  the  best  type  based  upon 
correlations  and  least  absolute  error  considerations.  Also, 
we  hope  to  reduce  the  number  of  types  shown  as  being  required 
in  figure  17  by  following  more  systematic  procedures  introduced 
by  Lund  whereby  the  first  type  chosen  is  representative  ot 
the  largest  number  of  fits,  the  second  type  the  next  largest 
number  of  fits,  etc. 


PC(%)  Ho.  of  Curve 

Types  Required 
for  500/^ 


No.  of  Curve 
Types  Required 
for  1000/1 


1.0 

86 

1.5 

44 

2.0 

31 

2.5 

23 

3.0 

15 

150 

95 

60 

34 

23 


Fig.  17.  The  number  of  types  required  to  reproduce  the 
unconditional  probabilities  for  a given  RUFFL'D 
ceiling/ visibility  category  for  an  unlimited 
number  of  stations  for  all  months  within  a given 
maximum  allowable  percentage  error  (PC). 
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The  various  types  which  fit  the  data  of  the  25  stations 
used  in  this  research  are  shown  in  figure  18.  These  types 
are  based  upon  the  criteria  that  the  ceilings  were  less 
than  five  hundred  feet  and/or  the  visibilities  less  than  one 
half  mile.  Although  it  is  envisioned  that  a number  some- 
what larper  than  thirty  might  be  needed  to  type  the  three 
hundred  sixty  some  stations  that  we  are  currently  processing 
for  all  months  of  the  year,  the  amount  should  remain  within 
manageable  bounds.  The  use  of  types  compacts  the  monthly 
data  for  any  one  station  for  two  ceiling  visibility  cate- 
gories into  8 digits  of  one  number. 

We  are  in  the  process  of  typing  the  monthly  profiles  of 
some  360  stations  for  which  we  have  extracted  RUSSl'O  data. 
These  results  will  be  nresented  in  a form,  amenable  for  ready 
operational  implementation  should  the  using  agencies  choose 
to  do  so.  Furthermore,  thev  should  lend  credence  to  our 
premise  that  the  unconditional  probability  data  can  be 
compacted  by  several  orders  of  magnitude  with  a minimum  of 
loss  in  integrity.  A hypothetical  example  of  this  might  be 
the  number  11070625.  Here  the  number  11  represents  twice 
the  value  of  5.5  for  the  1000/1  profile.  The  number  07  refers 
to  type  7 which  this  profile  has  been  found  to  match  by 
the  aforementioned  procedures.  The  number  06  refers  to 
twice  the  mean  value  for  the  500/h  category  and  25  designates 
the  type  that  matches  its  diurnal  profile.  We  have  chosen 
to  enter  twice  the  mean  value  to  minimize  round-off  errors 
occasioned  when  the  normalized  values  of  each  type  are  multi- 
plied by  that  mean  to  reproduce  the  station's  original  diurnal 
trace. 
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Fie.  18.  The  numbers  indicate  which  of  the  31  types  best  fit  the  diurnal  profile 
of  the  listed  stations  for  a given  month.  The  criteria  are  ceilings 
less  than  500'  and/or  visibilities  greater  than  1/2  mile.  The  match 
between  the  diurnal  profiles  and  those  of  the  "type”  must  be  within 
2%  at  every  grid  point. 
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